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SUMMARY 

Objective 

The objective of this report is to document the development and norming of parallel forms of the Air Force Reading 
AbiliticsTest (AFRAT). 

Background/Rationale 

The Air Force has been administering various commercially published reading tests to military personnel. These 
tests have been used for assignment of personnel to remedial training programs, as aids in career counseling of students, 
or for description of reading levels of airmen in various occupational specialties. A previous study on service applicants 
found large divergence in reading grade levels (RGLs) estimated from different commercial tests. The evidence suggested 
that RGL standards differ considerably from one commercial test to another. In addition to varying norms, the use of 
commercial tests has several other drawbacks, including high testing material costs and RGL norms of unknown 
appropriateness for military personnel. The goal of this effort was to develop reading tests with appropriate norms. 

Approach 

A total of 12,938 airmen was administered two reading tests (e.g. , either two forms of the AFRAT, or one AFRAT 
form and a commercial reading test). Analyses were computed to determine the equivalence of the AFRAT forms, their 
correlation with other reading tests, AFRAT raw score to RGL equivalents, and training grade validity of AFRAT item 
types. For establishing AFRAT-to-RGL equivalents, RGL standards were defined as the average RGL equivalent from 
sev eral commercial reading tests. 

Specifics 

The AFRAT consists of 45 vocabulary items in a synonym format and 40 comprehension items consisting of one 
or several paragraphs followed by one or more questions. The comprehension items require either paraphrasing or making 
inferences from the passages. All items are multiple-choice with four alternatives with a total test time limit of 50 minutes. 

Comparing AFRAT Forms A and B, the proportion of correct item responses was ,85 for each form, and average- 
item-to-test-tolal correlations were similar. In addition, subtest and total-score variances for AFRAT Forms A and B 
were equal. These data indicated that the two forms were parallel. 

The relationships of the two AFRAT forms to three commercial reading tests were moderate-to-high (correlations 
of approximately .60 to .67). The interrelation between the two AFRAT forms was somewhat higher (approximately .73). 
Since the sample population was restricted due to prior enlistment screening, the correlation between the AFRAT forms 
would be considerably higher if computed from a full-range sample. 

Percentiles were computed for AFRAT scores and RGL scores derived from commercial tests. AFRAT forms were 
equated to an average RGL through use of the Air Force General Aptitude Index (AI) from ASVAB as an anchor test, 
Raw-score-to-RGL conversion tables for the 4th through the 12th RGL were generated for AFRAT subtest and total 
scores. 

The median coefficient of correlation of AFRAT with technical training grades in 19 Air Force specialty groups 
was , 40. In 16 of the 19 groups, this correlation coefficient was greater than .30. 
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Conclusions/Recommendation 



Forms A and B of the AFRAT were found to be parallel. The computed percentile and RGL norms should be 
appropriate for enlistees. The AFRAT was found to correlate quite Well (approximately .60 or higher) with three 
commercial reading tests. A preliminary analysis indicated that the AFRAT would be a valid predictor of performance 
in technical training. 

It is recommended that the AFRAT replace the various commercial tests now being used as the operational test 
to screen enlistees for marginal or inadequate reading ability. 
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READING ABILITIES TESTS: 
DEVELOPMENT ANT) NORMINC FOR AIR FORCE USE 



I. INTRODUCTION 

Main Air Force organizations have been administering various commercially published reading tests to 
military personnel. These tests are used for assignment of personnel lo remedial reading training programs, as aids 
in career counseling of students, or for description of reading levels of airmen in various occupational specialties. 
The Tests of \dull Basic Education (TABE) (CTB/MeGraw-HilL 1976) comprise the reading lest instrument most 
frequently used in the Air Force. 

One of the problem* resulting from the use of different reading tests in the Air Force is the noted variation in 
computed reading grade levels (RGLs) for individuals w-th similar levels of intellectual functioning. A sludv on 
service applicants (Mathews, Valentine, & Selhnan, 1978) found considerably divergent RGLs from difft nl 
commercial tests for subjects of the same Armed Services Vocational Aptitude Battery (ASVAB) ability level. In 
addition, results indicated that the ASVAB Generajl (called General-Technical by some military services) composite 
correlated as highlv with some reading tests as (hose reading tests correlated with each other. Based on these 
results, the use of ASVAB lo estimate reading ability of groups was considered. However, there are some problems 
associated with using ASVAB composites lo measure reading ability of individuals. These cotnposiles contain 
several shorl subtests covering different ability (factors. The General composite includes Arithmetic Reasoning 
(\R), in addition to the verbal subtests of Wor/d Knowledge (WK) and Paragraph Comprehension (PC). Most 
women perform slightly better than do men on verbal tfsts; however, they generally do somewhat less well on AR 
than do men. When the General composite is used lo gauge reading ability of women, underestimation will result in 
the majority of cases. For individual measurement, therefore, a more con ten I specific and reliable measure of 
rending than that based on ASVAB was desired. 

The use of commercial lesls has several additional drawbacks, including high lesling material costs and RGL 
norms of unknown appropriateness for military personnel. To resolve these problems, it was decided that a reading 
lest should be developed specifically for Air Force use. The objective of this reporl is to provide a description of the 
development and nortning e' .lie Air Force Reading Abilities Test (AFRAT) lhal was designed lo standardize the 
a>sessmenl of reading abi««iy of Air Force personnel and lo replace the commercial reading lesls lhal have been 
used throughout the Air Force. 

II. METHOD 

Design Goals for AFRAT Form*? 

The following general goals were pursued in developing reading lests: 

1. Vocabularv and comprehension sections, as found in most commercial reading lesls, were designed. 

2. Comprehension passages were written with expository prose. 

3. Comprehension questions covered factual mailer lhal was unlikely to be answered correctly based solely 
on prior knowledge. 

t. Vocabulary words were selected which might likely be encountered in a work environment. Esoteric 
adverbs and adjectives were avoided lo keep the test from being overly academic in nature. 

5. The lest was designed lo be as reliable as possible but lo require less than one hour of lesling lime. 

Reading Measurement Instruments 

The following reading lesls were used in this study. 



!/••« IT Form X. \n experimental form of AFRAT was constructed based on available ilems from obsolete Air 
Foree classification tesls. This tesl was used lo obtain initial estimations of the construct and predictive validity of 
the item types. Due lo the limited pool of items, the difficulty of AFRAT Form X items varied considerably from 
very easy to very hard. 

.tFRAT I'orms I «/«/ ft Two parallel AFRAT forms (A and B) were developed. The second form allows for 
rclcsling after remedial training. Items wm selected from a pool assembled specifically lo specifications for 
\I RAT. The AFRAT consists of vocabulary items in a synonym formal and comprehension items consisting of one 
or several paragraphs followed bv one or more questions. The comprehension items require either paraphrasing or 
making inferences from the passages. AFRAT Forms A and B each contain 15 vocabulary and M comprehension 
items, with a total lest limit of 30 minutes (see Table 1). All items are multiple choice with four alternative*. Hie 
tests were targeted at the 8th RGL as measured by the Adult Basic Learning Examination (sec the followug 
paragraphs). Although AFRAT Forms A [ind B were lo be peaked at a difficulty level corresponding lo the 8th 
Rt;l.. the desired norms would span from ihe 5lh through the I2lh RGL. 



Tublr I. Test Lengths and Times for AFRAT Forms 

AFRAT A-B AFRAT X 

. , i 



Time Time 
SciJ( . No. Items (Minutes) No. Item* (Minutes) 



Vocabulary 45 15 50 10 

Comprehension 40 35 42 — > 

Tciial * 85 30 02 35 



(iates-WacCiniiie Reading Tests (Survey D). Included in these tests an> a 50-ilem vocabulary section and a 
12-ilem comprehension section, wilh a combined testing time of 10 minutes. The vocabulary items reqirrc the 
selection of synonyms for single words. The comprehension items consist of qucsli""* .'bout single paragraphs. 
\ocabular\ .iinl comprehension RGl.s of 1.0 to I 1.0 are reported (Gales & Mac'^i \u, . r>W 

/es/s of Ulull Busk Education (TtHE) Level I). This instrument consists of a Ui-iu-m v#v; binary section and 
i 13-ileni comprehension section, with a combined lime of 50 minutes for testing. '1 *ie voca *, uiry ilems queslion 
the meaning of word- in phrases. The comprehension items consist of questions aboi' j asv^ges containing one or 
several paragraphs. Vocabulary, comprehension, and combined RGLs of from T.O to 12.0 are reported (CTB/ 
McGraw-Hill, 1076). 

iduh Basic Learning Examination (ABE) Levels /-///. This instrument includes a 50-ilem vocabulary 
section and a 58-ilem comprehension section taking approximately 50 minulesof testing time (litis varies by level). 
The vocabulary items ask about (he meaning of words in phrases. The comprehension items consist of questions 
about single paragraphs. Vocabulary, reading, and problem solving sections were used lo calibrate ihe AS\ AB 
General Composite lo ABLE in an unpublished Army study completed in 1080. ABLE gives RGLs from 3.0 lo 12.0 
(Karl-en. Madden, & Gardner. 1071). 

Samples 

\ total of 0,535 subjects tested from May to July 1081. except as noled, formed ihe following seven samples: 

1. 025 \ir Foree trainees given AFRAT Forms A and B. 

2. 820 Air Force trainees given ABLE II and AFRAT Form A ( - 113) or \FRAT Form B (N - 107). 

3. 016 Air Foree trainees given Gates-MacGinitie and AFRAT Form A (N - 151) or B (N = 102). 

1. 883 Air Force trainees given AFRAT Form \ and AFRAT Form A (N - 150; or AFRAT Form B ( v 121). 

■ ft H 



."). 3.27 I \ir lore- trainee given TABE and AFRAT Form 'A (N -1951. composed of subjects from sample 
1 - D e.r AFRAT Form B (N - 1918. composed of subjects from samples 1 - t less 625 subjects given both forms). 

0. 1.0 10 Army trainees given AFRAT Form A (N = Wl) or AFRAT F< rm B (M = 558). 

7. 2.232 Air Force trainees given AFRAT Form X in 1 ( >78. 

In addition, data based on about 1.100 Army trainees given ABLE I. II. or III in 1980 and 2.053 service 
applieanis given Gales-MacGinilie in 1078 (Mathews. Valentine. & Selln.ai.,1978) were used in developing norms. 
These two tests and the TABE are widely used by the armed services. 

Analytic Methods 

\n item analysis program (Koplvav. 1981) was used to compute the following internal AFRAT and test 
summary statistic Difficulty (proportion answering each item correctly), item biserial (correlation of Hem with 
lest scale), internal consistency reliability (Ruder-Richardson Formula 20). lest mean, and standard deviation. 
Means for Army samples were adjusted in order to control for lest score differences resulting from sampling 
fluctuations. This was accomplished by using regression equations (Guilford & Fruchter. 1978) for predicting 
AFRAT scores based on the relationship of AFRAT forms with the ASVAB General composite. 

Construct and predictive validities of AFRAT form* were assessed through Pearson correlation coefficient (r) 
values, which were compute ! among tests. Predictive validities were obtained by correlating AFRAT Form X 
scores with technical training grades for .subsamples. Fisher's r to z transformations were used to average the r 
values across combined samples (Guilford & Fruchter. 1978). The technical training validation wa.s only a 
preliminary analysis as a more comprehensive study will be done on AFRAT Forms A and B when sufficient 
criterion data are available. 

Percentile norms were obtained for AFRAT forms, and AFRAT Forms A and B were placed on the same scale 
through equipercentile equaling (Angoff. 1971). This same procedure was used to equate AFRAT to TABE RGLs 
\FR \T Forms \ and B were also equated to ABLE and Gates-MacGinilie RGL scales through the use of the AS\ AH 
General composite as a common anchor lest. This is the Angoff (1971. p. 57(») Design III where all groups lake the 
common anchor tesl, and each group takes one of the reading tesls. 



III. RESULTS AND DISCUSSION 

AFRAT internal Analyses 

Table 2 gives the item difficulties for AFRAT Forms A and B based on Air Force trainees given both 
(Sample 1). These alternate forms appear to be of parallel difficulty, with fairly similar means and distribul 
The bulk of the items are quite easy with means around ,82 (not corrected for guessing). In comparison, the 'I 
ileitis had an average difficulty of .81 for the same sample (N = (>25). 

Table 2. Distribution of Difficulties (P) for AFRAT Forms A and B Items 

(N =~625) 



Vocabulary Comprehension 



Difficulty* 


A 


B 


A 


B 




18 


1() 


8 


15 


80 -80 


12 


11 


19 


15 


70-70 


8 


<) 


8 


5 


to) -60 


a 


5 


5 


2 


50 and less 


if 


1 


> 


3 


Tola! 


15 


15 




to 


Average P 


.798 


.796 


.8'^ 


,833 



a Decimal points omit led for readability. 



Tiif iimi-le*! hiMM iiil correlations (r^ ) an' iiKid«-ratc>-lc>-liigli for virtually all items, willi means of tbe r 
valuta for *ublesls of .00 to .(if) and an item r range of .29 lo .89. Again, the AFRAT forms appear parallel (see 
Table .*). 

Ttiblv •>. Distribution of AFRAT Forms A and B Item -Test Correlations 

(N - 625) 



Woabulary Comprehension 



a 

- r bis 


A 


B 


A 


B 


70-00 


<) 


11 


13 


15 


50-60 


28 


10 


16 


17 


30- 1«) 


8 


It 


11 


8 


20 and less 


0 


1 


0 


0 


Total 


15 


t5 


10 


to 


f" -Ins 


.508 


.507 


.613 


.6 tO 



"Decimal point* omitted for readability- 



An estimate ot mean AFRAT item performanee for snbjeels equal in average ability lo that of the normative 
population for ASVAB ean be obtained from the data eolleeled on Army trainees (Sample 6), Army samples given 
AFRAT bad an average ASV \B (General eoniposite score of about 50 percentile. 50.6 for AFRAT A sample and 19 J 
for AFR \T B sample. Mean AFRAT difficulties (P) for these subjects (sample 6) are given in Table I. Because the 
lowest ability subjects are excluded from service, the distribution of scores would differ in an applicant or 
normative sample. The average P value was .6°* for the Army samples compared to the P value of .82 for Air Force 
samples. Since ASV AB selection tests hav e P values of about ,70 (Ree, Mullins, Mathews, & Massey, 1982), AFRAT 
seems to be comparable in mean difficulty lo these tests. 



Table 4. Mean AFRAT Difficulties for Army Samples 







AFRAT Form 






A 


B 




Scale 


(N = 491) 


(N = 558) 


Average 


Vocabulary 


.69 


.66 


.68 


Comprehension 


JO 


JO 


.70 


Total 


.70 


.68 


,69 



AFRAT internal consistency reliability coefficients are shown in Table 5 for subgroups of Air Force samples. 
These data are based on all female and all Black trainees and on representative subsamples of male and Caucasian 
trainees. The average reliabilities were .92 for AFRAT Form A and .914ftr AFRAT Form B. These values-are quite 
high considering that reliability is maximized when item variance is large (i.e., when item difficulties are 
moderate). 



13 



10 



Table .). AFRAT Reliabilities for Air Force Subgroups 



Form 






Subgroup 






Black 


(Caucasian 


Female 


Male 


AFRAT A 


N 


520 


520 


731 


731 




Rei 


.<)2 


.<)2 


.89 


.92 


\FRAt B 


-M 


310 


510 


736 


736 




Rei 


.02 


.90 


.87 


.94 



Note. Internal consisleiiey reliabilities (Rei) based on formula KR-20. 

Reliabilities were not as high for female samples, .89 for AFRAT Form A and .87 for AFRAT Form B. This is 
most likely due to significantly lower score variance for women compared to men (F = 1.6, p < .01 on AFRAT Form 
A and F = 2.3, £ < .01 on AFRAT Form B). At least two plausible explanations for the gender difference in score 
variance exist. First, the mean AFRAT scores were 2.5 points higher for women than men, thus restricting the range. 
Second, some previous studies of aptitude/achievement tests have revealed higher male variance on a number of tests 
(Jensen, 1980). 

Test Intercorrelations 

Table 6 shows the intercorrelations for tests given to Air Force subjects in sample 5. These r values have not been 
corrected for restriction in range from selection on the ASVAB since it is doubtful that assumptions required to make 
corrections can be met. Despite the attenuation, the alternate AFRAT forms correlated .73. The degree of restriction 
in these r values is illustrated by visually comparing the r of .57 in Table 6 between Gates-MacGinitie and ASVAB 
General to the r of . 76 obtained between the same two measures in a study using service applicants (Mathews, Valentine, 
& Sellman, 1978). The average revalues for other tests was .65 for AFRAT A and .63 for AFRAT B. These AFRAT 
forms correlated somewhat more highly with other reading tests than did the TABE. The average r values for AFRAT 
and TABE were .65 and .57 with Gates-MacGinitie, respectively, and .62 and .50 with ABLE, respectively. The two 
AFRAT forms correlated to the same degree with TABE as they did with the GM and ABLE (average r = .64) with 
both AFRAT and TABE. 

Table 0, Test Intercorrelations 
(Samples 1-4, N Values Range from 407 to 3,274) 



Test 


AFRAT A 


AFRAT B 


AFRAT X 


TABE 


GM 


ABLE 


GT 


\m vr \ 


1.00 


.73 


,M 


.67 


.66 


.61 


.63 


\I RAT » 




1.00 


.65 


.61 


.61 


.60 




\FR \T \ 






1.00 


.56 


a 


a 


.63 


r \bk 








1.00 


.57 


.50 


.60 


(iiiies-MacGmilie 










1.00 


a 


.57 


YBI.K 11 












1.00 


.10 


General ( \SVAB) 














1.00 


a l)ue in sampii 


constrain!- 1 hese 


inirrcorrelaiions 


are unavailab 


le. 









Table 7 gives itllereorrelaliotis of similar subtests across reading tests. Among vocabulary subtests, tlie liigliesl 
r. .68. was between the two AFRAT forms. For comprehension subtests, the r between AFRAT Forms A and B. .62. 
was again the highest. Correlations among comprehension tests were generally lower than the r values among 
vocabulary tests. This would be indicative of more unique variance within the different comprehension tests than 
within the different vocabulary tests. 
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Table 7. Intereorrelations of Like-Warned Subtests 
(N Values = 407 to 3,274) 



Test 


AFRAT A 


AFRAT B 


AFRAT X 


TABE 


G-M 


ABLE 


Vocabulary 














VFRAT A 


1 .00 


.68 


.53 


.57 


.67 


.62 


VFRAT B 




1.00 


.52 


.18 


.61 


.52 


VFRAT \ 






1 .00 


.11 


ii 


a 


T VBE 








1.00 


55 


.11 


GaU's-Mardiiitie 










1.00 


n 


VBLE 11 












1.00 


Comprehension 














VFRAT A 


1.00 


.62 


.W 


.50 


.10 


.37 


VFRAT B 




1.00 


.53 


.16 


.38 


.16 


VFRVT \ 






1.00 


.18 


ii 


ii 


T VBE 








LOO 


.37 


.28 












1.00 


a 


VBI.EI1 












1.00 



a l>ue 'n sampling ron*irainu ilu»M' mU»rrorn»lahons are unavailable. 



AFRAT Norming 

Descriptive statistics for AFRAT Forms A and R are listed for Sample 1 in Table 8. AFRAT means and 
standard deviations for Army samples are given in Table 9. Adjusted means are also shown based on regression to 
compensate fornbilily differences on the ASVAB General composite. These differences noted earlier are eaused by 
sample fluctuations. These means, 58.6 for AFRAT Form A and 58.1 for AFRAT Form B, should be representative 
since these samples had the same average ability as the normative population. However, as previously mentioned, 
the distribution of scores in the general population would differ. 



7 «/>/<* «. AFRAT Forms A and B Means, 

and Standard Deviations (SD) 
(N = 625) 

Test M«*n SD 
AFRAT A 

Vocabulary 35.9 5.5 

Comprehension 33.1 5,8 

Total 69.0 10.2 

AFRAT B 

Vocabulary 35.8 5.3 

Comprehension 33.3 5.3 

Total 6<U 9.4 
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luhlc { K AFRAT Means and Standard Deviations (SD) 
for Army Samples (N = 491 and N = 558) 



AFRAT A AFRAT B 



Scale 


Mean 


SI) 


Mean 


SD 


\ (M almlai \ 


31.1 


9.5 


29.7 


0.6 


( loiuprrl'i'iMoii 


28.(1 


8.3 


27.8 


8.8 


loa! 


59.1 


10.8 


57.5 


17.2 


\ilju>!ed Toial a 


:>&(> 




58.1 




a ViIju-!ftl \ui it^rr^mn lor; 


1 1) 1 1 1 1 \ ilifiVri'iu'r-* 


on vsyab<;< 


nrral (lomposilr 





>,UHJ>lllifi. 

AFR VT Forms A and B were equaled using the equiperrentile method with Air Foree samples. Because llie 
form* appear parallel, the raw scores were combined to compute percentiles and to give a single, more stable 
conversion table (see Table 10). At every percentile point, raw scores for AFRAT Forms A and B are within one 
pom! of the average raw score. 

Table 10. Equipercentile Equating of AFRAT Forms A and 
B for Air Force Samples 





AFRAT A 


AFRAT B 


A-B Avg a 




Percentile 


Raw 


Raw 


Raw 


Percentile 


1 


to 


11 


40 


30 


o 


15 


16 


16 


32 


3 


18 


19 


18 


31 


1 


19 


51 


50 


36 


5 


51 


53 


52 


38 


(> 


52 


51 


53 


10 




51 


55 


51 


12 


8 


55 


56 


55 


11 


9 


3d 


57 


56 


16 


1(1 


57 


58 


57 


18 


12 


58 


59 


59 


50 


11 


59 


60 


60 


55 


In 


dO 


62 


61 


(>() 


18 


61 


63 


62 


65 


20 


62 


63 


62 


70 


22 


63 


61 


63 


75 


21 


61 


65 


61 


80 


26 


65 


65 


65 


85 


28 


66 


66 


66 


90 

95 



AFRAT A AFRAT B A-B Avg 



Raw Raw Raw 



66 67 66 

67 67 67 
(>8 68 68 
69 68 68 

69 69 69 

70 69 70 

70 70 70 

71 70 71 

71 71 71 

72 71 72 
72 72 . ,' 72 
75 75 75 

75 75 ! 71 

76 74 75 

77 75 76 

78 76 77 

79 77 78 

80 78 79 

81 79 80 
85 81 82 



'AH entries have been rounded to integer form. 



Tlte AFRAT is negatively skewed (i.e.. tlie raw score distribution is skewed to llie left), which is appropriate 
for a lest designed to identify low-performing subjects. The AFRAT median score (50lh percentile) was 72. 
compared to a mean of about 69 (from Table 8). A higher median than mean is characteristic of negatively skewed 



tests 



AFRAT percentiles for Army samples are listed in Table 11. The median score was about 62. compared to a 
mean of 58. 
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Tnhh' 1 1. Kquipercentile Equating of AFRAT Scores for Army Samples 



Percentile 


Ar at A 1 


Percentile 


APR AT 
ArnA 1 


A - B Average* 1 


A - B Average 


1 


21 


SO 


51 




o o 




52 


3 


23 


3i 


33 


1 


21 




53 


5 


25 


38 


56 


0 


26 


10 


37 


» 


28 


12 


58 


B 


20 


11 


5<) 


( ) 


M 


16 


60 


10 


33 


18 


61 


12 


35 


.)() 


62 


1 t 


38 


35 


6 J 


1(» 




60 


66 


18 


n 


63 


60 


20 


is 


70 


71 


>•> 


r> 


73 


72 


21 


K) 


80 


75 


20 


18 


83 


76 


28 


;>o 


<)0 


78 






05 


81 



Ml eutrie* liave been rounded lo integer form. 



Table 12 contains an equipercenlile calibration of AFRA^ scores to ASVAB General (or General-Teelinieal) 
composite percentiles based on combined Air Force and Army subjects (Samples 5 and 6), The General composite 
is the AS\ AH measure which lias been found lo correlate highest with reading tests (Mathews el al.. 1078). 



Table 12. Equiperc entile Calibration of AFRAT Form A-B 
Average Scores to ASVAB General Composite 



General Compoaite AFRAT Raw Score 



10 


18 


15 


20 


20 


23 


25 


34 


30 


45 


35 


55 


40 


58 


45 


62 


50 


65 


55 


68 


60 


71 


65 


73 


70 


75 


75 


76 


80 


77 


85 


79 


90 


81 


95 


83 
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FquipereeiUile calibrations of i>liu*r reading tests to \SVAB general percentiles are shown in Table 13. The 
data on \B1.K and UUes-MacCinilie are based on previous studies (see "Samples'* subsection), and the TABE data 
are from Air Force Sample .1 in this study. 



Tnblv /.>*. Equiperccntilc Calibration of Reading Tests 
to General Composite Percentiles 



VSVAB 
General 


ABLE 8 
Grade I. 


TABIC 

rgl 


0-Mb 

r(;l 


Average 
RGL 


10 


.i.t 




— — — ■ 

t.o 


1.7 


1 ) 


6.3 




ti<) 


0.1 


20 


7.0 


0.0 


0.0 


6.0 


J.) 


7.0 


1 . 1 




7.r> 


30 


8.0 


8.7 


7.0 


8.2 


.■r> 


8. 1 


0.7 


8.9 


0.0 


in 


8.7 


0.0 


0. 1 


0.3 


Li 


0.0 


10.1 


0.0 


0.7 


,10 


0.1 


10.0 


10.1 


10.1 


.1.1 


0.8 


11.0 


10.0 


10.0 


00 


10.1 


11.1 


11.2 


11.0 


o;> 


10.7 


11.8 


11. .1 


11.3 


To 


11.1 


12.2 


11//" 


11.7 


7.1 


1L1 


12.5 




12.0 


80 


11.7 


12.8 




12.2 


8.1 


12.0 


I2.0 a 




12.1 


00 


12.3 






12,0 


0.1 


12.7 






12.0 



Ba>ed cm data from Army MibjeeN it»v,it»<l in 1 ( )80. 
'Ba-cd mi reiionnmg of data from 1078 siudv. 
' Maximum U(il. from normaiive i iil>! i> 1 1.0, 



It is apparent that there are substantial differences in grade level norms among the commercial reading tests. 
\i *orne specific levels, at least one grade separates each of the reading tests from another. Without substantial 
evidence as to which lest yields the most accurate R(iL conversions, a good estimate should be obtained by 
averaging the R(il> across the commercial tests. The column on the right side of Table 13 gives this average. 

Equipercentile conversions of average RilL for each AFRAT total raw score point are shown in Table 11. 
Separate R(*L conversions tor AFRAT Vocabulary and Comprehension subscores arc listed in Table 1.1. 
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' 7 ii Me 14. AFRAT Forms A and B Total Score Conversions 
to Beading Grade Level (RGL) 



ArKAl lotal 




APR AT Tntal 




1-1") 


1.0 


51 


8.6 


1() 


1.2 


52 


x 8.7 


17 


I.I 


53 


8.8 


18 


JkT 


51 


8,9 


19 


5.3 


55 


9.0 


20 


0.1 


56 


9.1 * 


21 


d.5 


57 


9.2 


22 


d.T 


58 


9.3 


23 


6.9 


5') 


9.1 


21 


6.9 


W) 


9.5 


2"y 


(i.9 


61 


9.6 


It) 


7.0 


62 


9.7 


>" 


7.1 


63 


9.8 


28 


7.1 


61 


10.0 


29 


7.1 


65 


10.1 


30 


7.2 


66 


10.3 


31 


7.3 


67 


10.5 


32 


7.3 


68 


10.6 


33 


7.1 


69 


10.7 


31 


7.5 


70 


10.8 


35 


7.0 


71 


11.0 


,u> 


i , j 


72 


11.1 


37 


7.8 


73 


11.3 


38 


7.8 


71 


11.5 


30 


7.9 


75 


11.7 


10 


7.9 


76 


12.0 


11 


8.0 


77 


12.2 


L 


8.0 


78 


12,1 


13 


8.1 


79 


12.1 


u 


8.1 


'', 80 


12.5 


15 


8.2 


• 81 


12.6 


10 


8.2 


1 82 


12.7 


17 


8.3 


83 


12.9 


18 


8.1 


81 


12.9 


W 


8.5 


85 


12.9 


50 


8.5 







1.9 
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Ttiblr / ). AFRAT Vocabulary and Comprehension 
Reading Grade Level (RGL) Conversions 



Vocabulary 
Score 


Average 





Comprehension 
Score 


Average 
RGL 


1-7 


— — — — 

1.0 




I - i 


I (1 


8 


1 M 




\\ 


U 


0 


1. J 




{) 


3.0 


10 


5.0 




1 i\ 


0.1 


! ! 


fx 1 






6.8 


12 


< .0 




1 M 


0.8 


13 


* .2 




1 4 
1 .1 


0.0 


1 1 


*"' i 
t . 1 




1 J 
1 1 


7.0 


1") 


7.") 




1 •) 


7.1 


10 


7.0 




1 ft 
1 l) 


7 ° 


IT 


i .8 






7.3 


18 


7.0 




1 O 


7. 1 


lo 


8.2 




1 O 


< .•) 


;in 






on 


7.0 


21 


8. 1 




O 1 

J. 1 


7 8 


•) ) 


8. 1 




oo 


8.0 


23 


8.3 




•vi 
~.> 


8.2 


21 


8.3 




M i 


8. 1 


2.1 


8.0 


• 


2.) 


8 7 
(>. < 


20 


8.8 




o/» 
~0 


8.0 




8.0 




2 i 


0.1 


28 


o.o 






0.3 


2*' 


o.l 




20 




30 


0.2 




30 


0.0 


31 


0. [ 




1 1 


0.0 


32 


0.0 






10.2 




0.0 




33 


10.3 


3 \ 


Ml.. J 




3 1 


10.7 


33 


1 0.0 




. J. ; 


1 1.1 


Mi 


1(10 




3() 


11.0 


1 1 




'47 


12.2 


' • 

38 


lit" 




38 


12.1 


30 


11.8 




30 


12.7 


10 


12.2 




10 


12.0 


11 


12..) 








12 


12.0 








13 


12.8 








1 1 


12.0 








13 


12.0 









Technical Training Validation 

In order lo pel' an initial estimate of I he predictive validity of the item types in AFRAT. Form \ was 
administered to approximately 3.000 airmen.. Technical training grades were subsequently obtained for those in 
common V.r Force Specially Code (AFSC) groups. Validities for AFRAT Form X in \<) AFSC groups (total N = 
2.233) are listed in Table id. The median r with training grades was .10. Validities were generally higher for 
Comprehension than for Vocabulary. This is lo be expected due to selection on the ASVAB General composite 
which has more vocabulary than reading comprehension content. This would severely restrict r value involving a 



.20 



vocahnlan lest given after qualifying on ASVAB. A more complete validation study involving AFRAT Forms A and 
B will be accomplished when criterion data are obtained for sufficiently large samples. 

Table 16. AFRAT Form X Validities' for Technical Training Grades 



AFSC Code b N Vocabulary Comprehension Total 



276 


42 


.36 


.31 


.39 


304 


91 


.52 


.58 


.61 


326 


57 


.27 


.37 


.36 


423 


178 


.26 


.40 


.38 


426 


151 


.40. 


.29 


.43 


431 


217 


.26 


.35 


.35 


161 


84 


.32 


.49 


.45 


462 


48 


.21 


.48 


.44 


54X 


66 


.31 


.41 


.41 


55X 


69 


.17 


.09 


.13 


571 


67 


.33 


.41 


.40 


605 


50 


.10 


.22 


.19 


631 


84 


.27 


.50 


.47 


645 


148 


.28 


.40 


.38 


702 


376 


.24 


.31 


.31 


732 


38 


.15 


.40 


.34 


811 


294 


.39 


.43 


.45 


902 


134 


.35 


.38 


.40 


922 


38 


.34 


.14 


.26 


c Not corrected for range restriction. 








h AFSC code 


is used to identify clusters 


of highly similar jobs. 







IV. CONCLUSIONS AND RECOMMENDATIONS 

Two parallel forms of the AFRAT have been developed and calibrated to three commonly used reading tests and 
appear to meet administrative and psychometric specifications. All items correlate positively with total test score and 
are in an appropriate range of difficulty (from average to very easy) for use in detecting reading deficiency. 

The AFRAT appears to be a highly reliable instrument (internal consistency coefficients of .92 for Form A and 
.91 for Form B). The two AFRAT forms appear parallel based on similar distributions of item difficulty and criterion 
correlation values and statistically equivalent means and variances. AFRAT correlated .60 or higher with each of the 
three commercial tests. 

Interpretation of AFRAT scores is provided by percentile norms and calibration to an average RGL based on 
tile commercial tests. A calibration is also presented with ASVAB General percentile scores. A preliminary analysis 
indicated the AFRAT would be a valid predictor of technical training performance. 

It is recommended that AFRAT Forms A and B replac e commercial reading tests for use in screening enlistees 
for marginal or inadequate reading ability. 
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